klotz: machine learning* + data science*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

  1. The article explores the concept of Retrieval-Augmented Generation (RAG) using SQLite, specifically with the sqlite-vec extension and the OpenAI API. It outlines a simplified approach to RAG, moving away from complex frameworks and cloud vector databases, using SQLite's virtual tables for vector search and semantic understanding.

  2. A comprehensive guide to Large Language Models by Damien Benveniste, covering various aspects from transformer architectures to deploying LLMs.

    • Language Models Before Transformers
    • Attention Is All You Need: The Original Transformer Architecture
    • A More Modern Approach To The Transformer Architecture
    • Multi-modal Large Language Models
    • Transformers Beyond Language Models
    • Non-Transformer Language Models
    • How LLMs Generate Text
    • From Words To Tokens
    • Training LLMs to Follow Instructions
    • Scaling Model Training
    • Fine-Tuning LLMs
    • Deploying LLMs
  3. The article discusses methods for data scientists to answer 'what if' questions regarding the impact of actions or events without having conducted prior experiments. It focuses on creating counterfactual predictions using machine learning techniques and compares a proposed method with Google's Causal Impact. The approach involves using historical data and control groups to estimate the effect of modifications, addressing challenges such as seasonality, confounders, and temporal drift.

  4. This article provides an overview of feature selection in machine learning, detailing methods to maximize model accuracy, minimize computational costs, and introduce a novel method called History-based Feature Selection (HBFS).

  5. This article provides a non-technical guide to interpreting SHAP analyses, useful for explaining machine learning models to non-technical stakeholders, with a focus on both local and global interpretability using various visualization methods.

  6. A guide on how to use OpenAI embeddings and clustering techniques to analyze survey data and extract meaningful topics and actionable insights from the responses.

    The process involves transforming textual survey responses into embeddings, grouping similar responses through clustering, and then identifying key themes or topics to aid in business improvement.

  7. PCA (principal component analysis) can be effectively used for outlier detection by transforming data into a space where outliers are more easily identifiable due to the reduction in dimensionality and reshaping of data patterns.

  8. A deep dive into time series analysis and forecasting methods, providing foundational knowledge and exploring various techniques used for understanding past data and predicting future outcomes.

  9. A detailed overview of the architecture, Python implementation, and future of autoencoders, focusing on their use in feature extraction and dimension reduction in unsupervised learning.

  10. Support Vector Machine (SVM) algorithm with a focus on classification tasks, using a simple 2D dataset for illustration. It explains key concepts like hard and soft margins, support vectors, kernel tricks, and optimization probles.

Top of the page

First / Previous / Next / Last / Page 1 of 0 SemanticScuttle - klotz.me: Tags: machine learning + data science

About - Propulsed by SemanticScuttle